Corpus-based metrics for assessing communal common ground

نویسندگان

  • Roman Kutlák
  • Kees van Deemter
  • Chris Mellish
چکیده

This article presents the first attempt to construct a computational model of common ground. Four corpus-based metrics are presented that estimate what facts are likely to be in common ground. The proposed metrics were evaluated in an experiment with human participants, focussing on a domain of famous people. The results are encouraging: two of the proposed metrics achieved a large positive correlation between the estimates of how widely known a property of a famous person is and the percentage of participants who knew the corresponding property.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Content Selection Challenge - University of Aberdeen Entry

Bouayad-Agha et al. (2012) issued a content determination challenge in which researchers were asked to create systems that can automatically select content suitable for a first paragraph in a Wikipedia article from an RDF knowledge base of information about people. This article is a description of the system built at the University of Aberdeen. Our working assumption is that the target text sho...

متن کامل

The suitability of common metrics for assessing parotid and larynx autosegmentation accuracy

Contouring structures in the head and neck is time-consuming, and automatic seg-mentation is an important part of an adaptive radiotherapy workflow. Geometric accuracy of automatic segmentation algorithms has been widely reported, but there is no consensus as to which metrics provide clinically meaningful results. This study investigated whether geometric accuracy (as quantified by several comm...

متن کامل

Ground Truth, Reference Truth & “Omniscient Truth” -- Parallel Phrases in Parallel Texts for MT Evaluation

Recently introduced automated methods of evaluating machine translation (MT) systems require the construction of parallel corpora of source language (SL) texts with human reference translations in the target language (TL). We present a novel method of exploiting and augmenting these resources for task-based MT evaluation, assessing how accurately people can extract Who, When, and Where elements...

متن کامل

Combined Mapping of Multiple clUsteriNg ALgorithms (COMMUNAL): A Robust Method for Selection of Cluster Number, K

In order to discover new subsets (clusters) of a data set, researchers often use algorithms that perform unsupervised clustering, namely, the algorithmic separation of a dataset into some number of distinct clusters. Deciding whether a particular separation (or number of clusters, K) is correct is a sort of 'dark art', with multiple techniques available for assessing the validity of unsupervise...

متن کامل

3d Landscape Metrics to Modelling Forest Structure and Diversity Based on Laser Scanning Data

This paper investigates the potential of laser scanning data to model forest 3d structure and its spatial pattern. Most of the existing methods for assessing structures in mountain forests are either inventory methods, which cannot be used for spatial assessments over large areas, or methods aimed only at assessing actual wood production. Several new landscape metrics are developed and applied ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012